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Abstract 

The paper aims at analyzing the least squares ranking method for generalized tournaments with 
possible missing and multiple paired comparisons. The bilateral relationships may reflect the out¬ 
comes of a sport competition, product comparisons, or evaluation of political candidates and policies. 
It is shown that the rating vector can be obtained as a limit point of an iterative process based on the 
scores in almost all cases. The calculation is interpreted on an undirected graph with loops attached 
to some nodes, revealing that the procedure takes into account not only the given object’s results 
but also the strength of objects compared with it. We explore the connection between this method 
and another procedure defined for ranking the nodes in a digraph, the positional power measure. 
The decomposition of the least squares solution offers a number of ways to modify the method. 

Keywords: Preference aggregation. Paired comparison. Ranking, Least squares method, Laplacian 
matrix 


1 Introduction 

Ranking of alternatives is becoming an important tool for individuals, enterprises and nonprofit or¬ 
ganizations to help their decision making processes. In various contexts the necessary information is 
available as outcomes of paired comparisons of the objects. Problems of this kind appear in social choice 
theory, statistics (Eltetd and Koves, 1964; Szulc, 1964), sports (Landau, 1895, 1914; Zermelo, 1928) or 
psychology (Thurstone, 1927), to name a few. 

There exist two fundamentally different approaches in ranking methodology. The first one seeks 
various scoring functions, giving a weight or valuation to all alternatives, that is, they compress the 
paired comparison matrix into a single rating vector. The second approach is based on the approximation 
of the (generalized) tournament by linear orders (Kemeny, 1959; Slater, 1961), which usually leads to 
interesting combinatorial and algorithmic problems (Hudry, 2009). From a theoretical viewpoint these 
methods have two great disadvantages: the possible occurrence of multiple optimal solutions and the 
difficulties arising in the examination of their (normative) properties (Bouyssou, 2004). Consequently, 
we will follow the former approach. 

Score seems to be the most obvious rating method: it is obtained by adding the number of victories 
for each object. It is an appropriate choice in the case of complete tournaments such that all objects 
have set against each other at the same number of occasions. However, there are many situations where 
it is unfeasible to get direct information about each pair of alternatives. It implies that the schedule 
becomes important since an object compared with weak opponents may score more victories than its 
peers facing stronger objects. In this case, for example for Swiss system tournaments, the application of 
scores in order to rank the objects can be questioned (Csato, 2013). 

In order to take into account the quality of opponents, a large number of scoring procedures have been 
suggested, see, for instance, Chebotarev and Shamis (1998a) for a survey of them. Chebotarev and Shamis 
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(1999) introduces two classes, the win-loss combining and win-loss unifying procedures, to categorize the 
methods proposed in the literature. Win-loss combining procedures can usually be calculated iteratively 
on a graph, where the vertices represent the alternatives and the edges reflect the paired comparisons. 
Among them, PageRank is one of the most popular (Brin and Page, 1998). Slikker et al. (2012) in¬ 
tegrated its core, the invariant method (Daniels, 1969; Moon and Pullman, 1970) as well as fair bets 
(Daniels, 1969; Moon and Pullman, 1970) and A (Borm et ah, 2002) methods into a single framework, 
and interpreted them on graphs. Herings et al. (2005) dehnes another method, the positional power 
function for ranking the nodes of directed graphs. 

For the other class of win-loss unifying procedures, graph interpretation becomes more difficult since 
they treat wins and losses uniformly, therefore the direction of edges do not count. We know only 
one result in this field, the iterative calculation of a subclass of the generalized row sum (Chebotarev, 
1989, 1994), a parametric family of ranking methods (Shamis, 1994). This paper gives a graph inter¬ 
pretation for the least squares method (Horst, 1932; Morrissey, 1955; Mosteller, 1951; Gulliksen, 1956; 
Kaiser and Serlin, 1978) through the use of scores and the comparison structure. 

The iterative calculation is similar to the positional power (Herings et al., 2005), which is somewhat 
surprising, since the least squares method is defined as an optimization problem, not as an intuition- 
based proposal. However, there are two main differences besides the applied approach (win-loss unifying 
vs win-loss combining): the role of initial scores and iterated ratings in the calculation, and the choice 
of the parameter reflecting the importance of successors in digraphs or objects compared with the given 
one. 

The graph interpretation also provides a lot of possibilities to modify it, in order to eliminate its 
drawbacks from an axiomatic viewpoint (Gonzalez-Diaz et al., 2014). We hope that simple calculation 
and evident connection with the scores can inspire practical applications, as well as offer an alternative 
to the extensively used PageRank method in certain cases. Nevertheless, it should be taken into account 
that scientometrics differs from our tournament context since a citation from a journal to another is 
advantageous for the latter, but not necessarily unfavourable for the former. 

The paper is organized as follows. The setting is presented in Section 2, where two scoring procedures 
are introduced, too. Section 3 deals with the least squares method, for which a new iterative solution 
method is given. It is used for the decomposition of the rating vector leading to the graph interpretation, 
discussed in Section 4. Finally, Section 5 concludes our results. 


2 Notations and rating methods 

Let N = {Al, ^ 2 ,..., A„}, n e N be a set of objects and R = ..., m e N be an array 

representing the outcomes of paired comparisons between the objects, where (p = 1,2,... ,m) is 
an n X n nonnegative matrix corresponding to the pth experiment, round of tournament, questionnary 
etc. Matrices R^^^ may be defined partially, and remain unknown if objects Xi and Xj were not 
compared in the pth round. A fully defined matrix R^p^ is called a complete paired comparison matrix, 
while the one with some ’missing’ elements is incomplete. For all pairs of objects compared [Xi,Xj), 
r= 1 is assumed, rcan be interpreted as the likelihood assigned to the event Xi is better than 
Xj in the pth round of a tournament. Diagonal elements ru are supposed to be 0 for alH = 1, 2,..., n, 
but they will not be used for the ranking methods discussed. 

Most scoring methods are based on the aggregated paired comparison matrix R = (r^) containing 
the sum of the results for all pairs of objects: 


0 

/ V _1 „(P) _j ' id 


if r^j'^ is not defined for every p 
otherwise. 


1, 2, .. . , TO 


Generally, the outcomes can be aggregated by taking a weighted sum, thus associating different 
weights on various rounds/experts/areas etc. It makes sense in forecasting sport results, when the latest 
paired comparisons are considered to be more important. 

The pair {N, R) is called a preference profile. The set of preference profiles is TZ. This setting is 
able to integrate four extra features in addition to those of binary tournaments (complete, weak and 
asymmetric binary relations, see (Rubinstein, 1980)): 
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• the possibility of ties: = rji\ 

• different preference intensities captured by the likelihood 'Tijlivij + fji); 

• incompleteness as is undefined (unknown) for some pairs of objects 

• multiple comparisons between objects: is known for more than one p. 

For example, if only strict binary relations are allowed, then S {0,1} for all p = 1, 2,..., m. This 
notation follows Chebotarev and Shamis (1998a) and Gonzalez-Di'az et al. (2014). 

Regarding the four extensions of the original binary tournament model, the possibility of ties is 
an immediate consequence of different preference intensities, a common feature in many situations. 
Multiple comparisons can arise naturally from the definition of preference profile. It can be supposed 
that incompleteness does not appear in an ideal case; however, we want to allow an expert to avoid 
judgement if he/she is not familiar with the two alternatives. The lack of direct information about 
paired comparisons may arise when the problem contains a large number of objects or it is too expensive 
to compare each pair; the latter is the reason of the emergence of knockout or Swiss system tournaments 
in some sports. Another case of incomplete comparisons is the need for predicting the final ranking 
before all rounds of a round-robin tournament played. 

A rating (scoring) method / is an 7^ K" function, /j = /(iV, R)i is the rating of object Xi. It 
defines a corresponding ranking method (p, that is, a (transitive and complete) weak order such that the 
objects are arranged according to /: ip ranks Xi weakly above Xj if and only if fi > fj. It is denoted 
by Xi Xj or simply by Xi ^ Xj, if it is not misleading. This definition of ^ already determines that 
Xi is ranked strictly above Xj if and only if Xi is ranked weakly above Xj, but Xj is not ranked weakly 
above Xp. {Xi >~ Xj) [{Xi ^ Xj) and -^{Xj ^ Xi)]. Finally, the ranking can be tied between objects 
Xi and Xj-. Xi ~ Xj [{Xi >: Xj) and {Xj ^ Xi)]. Ratings give cardinal, and rankings give ordinal 
information about the objects. Throughout the paper, the notions of rating and ranking methods will 
be used analogously since the discussed ranking procedures are based on rating vectors. 

A scoring procedure is neutral if any reindexing of the objects in N preserves their rating. A scoring 
procedure is anonymous if any reindexing of the paired comparison matrices in an array R preserves 
the ratings of objects. It seems to be quite natural to demand neutrality and anonymity; all rating 
procedures discussed here will satisfy these conditions. Note that a method based on the aggregated 
paired comparison matrix R is always anonymous. Rating procedures fi and /2 are called equivalent if 
they result in the same ranking. 

Ranking of the objects involves two main challenges. The first one is common in all paired comparison 
models: the possible appearance of circular triads, when object Xi is better than Xj (that is, > rji), 
Xj is better than X^, but X^ is better than Xi. Circular triads generate difficulties in all paired 
comparison settings, but, if preference intensities also count, other triplets {Xi, Xj, Xk) may produce 
problems. The second issue arises as the consequence of incomplete and multiple comparisons: the 
performance of objects compared with Xi strongly influences the observable paired comparison outcomes 
rij. For example, if Xi was compared only with Xj, then its rating certainly should depend on the results 
of Xj . We will see that this argument can be continued infinitely. Since both problems can occur only 
if there is at least three objects, the case n = 2 becomes trivial. 

An alternative representation of paired comparisons is the following. The additive paired comparison 
matrix can be derived from by ’centering’ the outcomes of paired comparisons such that 
undefined comparisons is set to 0. Now is a skew-symmetric 

matrix. It is called consistent if = Oik + akj for all triplets {Xi, Xj, Xk), and inconsistent if this 
condition is not satisfied for some {Xi, Xj, Xk). The aggregated additive paired comparison matrix 
A = {ttij) is defined analogously by A = A^p\ and will be referred to as the results matrix. 

The numbers of comparisons between the objects determine the matches matrix M = {niij): 


Wy¬ 


the number of indices 1 < p < m such that r^^'^ is defined ii i ^ j 
0 if f = j. 


M is a symmetric matrix and 0 < Wy < m. It is not restrictive to assume that m = max^y- Wy if the 
reduced matrix A is analyzed. In most practical applications (and in our setting above) Wy G N, but 
the whole discussion is valid for my S M+ as well, this domain choice has no impact on the results. The 
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generalization has some significance for example in the above mentioned problem of forecasting sport 
results. Here the latest comparisons contain more information about the current form of the player, 
which may be addressed by exponential smoothing, a technique usually applied to time series data. 

The matches matrix M is called block diagonal and block anti-diagonal, respectively, if it has a 
partition 7Vi U iV 2 = N, |A^i| = ni and |A^ 2 | = n 2 such that with a possible reordering of the objects 

M = 

y ^n2Xni 

respectively, where the subscripts denote the dimensions of (sub)matrices. Furthermore, di = 
is the total number of comparisons of object Xi. 

Results matrix A and matches matrix M together with the set of objects N determine a ranking 
problem {N,A,M) or {A,M) for short. In this modified setting, (a^ rriij)/{2mij) G [0,1] may be 
regarded as the likelihood that object Xi defeats Xj. 

A ranking problem is called round-robin if = 1 for all i ^ j, that is, every object has been 
compared with all the others exactly once and dt = n — 1 for all i = 1, 2,..., n. A round-robin ranking 
problem is more general than the binary tournaments of Rubinstein (1980) as it allows for ties (a^ = 
Oji = 0) and preference intensities (a^ is not necessarily —1 or 1). A ranking problem is called unweighted 
if TOy G {0,1} for all i ^ j, namely, every paired comparison is carried out at most once. Otherwise the 
ranking problem is called weighted. 

Matrix M can be represented by an undirected multigraph G := {V, E) where vertex set V corresponds 
to the object set N, and the number of edges between objects Xi and Xj is equal to m^ . Therefore 
the set of edges represents the structure of known paired comparisons. The number of edges adjacent 
to Xi G is the degree di of node Xi. A path is a sequence of objects X/c^,Xfc 2 ,... ,Xkt such that 
> 0 for all £ = 1, 2,..., t — 1. Two vertices are connected if G contains a path between them. 
A graph is said to be connected if every pair of vertices is connected. The adjacency matrix of G is 
given with the elements tij = 1 if mij > 0 and tij = 0 otherwise. 

Graph G is called the comparison multigraph associated with the ranking problem (N, A, M), however, 
it is independent of the results of paired comparisons. The Laplacian matrix L = [(-ij], i,j = 1,2,... ,n 
of graph G is an n X n real matrix with £ij = —mij for all i ^ j and in = di for all i = 1, 2,..., n. L 
has real and nonnegative eigenvalues (it is positive semidefinite) (Mohar, 1991, Theorem 2.1), denoted 
by Ml > /r 2 > • • • > Mra-i ^ Mn = 0- Lof o S bo fbe unit column vector, that is, = 1 for all 
* = 1, 2,..., n. 

Now we define two rating methods for the ranking problem {N, A, M). 

Definition 2.1. Row sum rating method: s = = Ae. 

Row sum will also be referred to as scores, s is sometimes called the scores vector. The following para¬ 
metric rating procedure was constructed axiomatically by Chebotarev (1989) and thoroughly analyzed 
in Chebotarev (1994). 

Definition 2.2. Generalized row sum rating method: it is the unique solution x(e) of the system of 
linear equations (/ -I-£L)x(e) = (1 -|-£77in)s, where e > 0 is a parameter, s is the scores vector, I is the 
n X n identity matrix, and L is the Laplacian matrix of the comparison multigraph G. 

It follows from the definition that this procedure results in the row sum ranking if e = 0. For larger 
parameter values it adjusts the standard scores of objects by accounting for the performance of objects 
compared with it, and so on. e indicates the importance attributed to this correction of scores s. 

Both the score and the generalized row sum ratings are well-defined and easily computable from a 
system of linear equations for all ranking problems {A,M). 


flni Xn2 
-'"712 Xn2 


and M = 




. Ml 

'raiXni -'”nixrt2 

M^ n 

-'"712X711 "712X712 


3 The least squares method and its solution 

Another approach to ranking is the statistical estimation by identifying hij = 2aij j mij as the realized 
difference between the latent valuations of objects Xi and Xj. In the ideal case no randomness is present 
and there exists a rating vector q G K" such that hij = qi — qj for all pairs of objects {Xi, Xj). It requires 
the consistency of the results matrix A since 0 = {qi — qj) -\- {qj — qk) + {pk — Pi) = hij -|- hjk -\- hki for 
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Figure 1: The preference graph of Example 3.1 



all {Xi, Xj, Xk). If it is inconsistent, the actual outcome hij may differ from its ’expected value’ qi — qj, 
and it makes sense to apply the following least squared error minimization 

Xi,Xj£N 

This method was discussed by Horst (1932) and Mosteller (1951) in the round-robin case, and was 
extended to unweighted problems by Gulliksen (1956) and Kaiser and Serlin (1978). The weighted 
case is examined in Bozoki et al. (2014) and Gonzalez-Diaz et al. (2014), but it can also be regarded 
as unweighted by summation over indices i,j,p (Chebotarev and Shamis, 1999). Clearly, the problem 
has an infinite number of solutions because the value of the objective function is the same for q and 
q + /3e,/3 G R. A natural normalization is e^q = 0. The generalized row sum can be considered as a 
Bayesian modification of the least squares estimation (Chebotarev, 1994). 

The first-order conditions of optimality give the following system of equations with unconstrained 
variables G M for alH = 1, 2,..., n: 


/ di -mi2 -mi3 ... \ 


f 1 


/ Si \ 

-mi2 d2 -m23 .. ■ -m2,n-i -rrL2,n 


92 


S2 

-77131 -77723 ds ■ ■ • -m^.n 


93 

= 

S3 

777^—1,1 r77n—1,2 1,3 ■ ■ • dn — 1 l^n 


Qn—1 


^n—1 

\ ^n,l ^n,2 ^n,3 ■ ■ • ^n,n—1 dn j 


\ 9n J 


\ S„ / 


where di = denotes the total number of A^’s comparisons, and the element in the (*, j) position 

{i ^ j) of the coefficient matrix equals —rriij. On the right-hand side, Si = ^ij i® score of object 
Xi. Due to the convexity of the objective function, the system of linear equations is a sufficient condition 
for optimality. 

Note that the n x n matrix on the left-hand side is exactly the Laplacian matrix associated with the 
comparison multigraph, thus the first-order conditions give Lq = s. L has no inverse as sum of its rows 
(and columns) is zero. 

Definition 3.1. Least squares rating method: it is the solution q of the system of linear equations 
Lq = s and e^q = 0. 

Corollary 1. The least squares rating can be obtained as a limit of the generalized row sum method if 
e oo. 

Proof. See Chebotarev and Shamis (1998a, p. 326). □ 

Example 3.1. (Chebotarev, 1994, Example 1) Suppose that the graph on Figure 1 is a preference graph, 
reflecting the dominance relation between the objects: aij = mij = 1 if and only if there is an edge from 
Xi to Xj . 
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The corresponding results, matches matrices and the scores vector are as follows 
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The solution for the least squares method is 

q=[ 1.810 0.476 0.810 -0.524 -0.190 -1.524 -0.857 ]^. 

The scores method (s) does not show the strength of objects compared with the given one. However, it 
is strange to assume that Xi and have performed equally because the former has beaten the latter 
indirectly through X^. Least squares method results in the ordering Xi >- X 3 >- X 2 >- X^ >-■ X 4 >- X^ >- 
Xe. 


Gulliksen (1956, p. 127) notes for the unweighted case that, in general, the first minor of L has 
an inverse, which makes it possible to normalize the rating vector by = 0 and eliminate the last 
equation. After that, the upper-left (n — 1) x (n — 1) submatrix of L denoted by L_i is taken with the 
corresponding first n — 1 components of s and q, denoted by s_i and q_i, respectively. If [L_i]~ exists, 
then q_i = s_i. It means that the first n — 1 equations of the system Lq = s remain to be 

satisfied if = 0 is attached to q_i. The last equation is true because the sum of the first n — 1 rows 
of L is the opposite of the last, and similarly, the sum of the elements of s_i is equal to — s„. 

In a round-robin ranking problem [T-i] ^ can be computed explicitly, it is an (n — 1) x (n — 1) 
matrix with 2/n in each diagonal and 1/n in each off-diagonal entry (Gulliksen, 1956, p. 127). The 
unique solution is qi = = Si/n, implying that the row sum and least squares rankings 

coincide. This property is called score consistency {SCC) by Gonzalez-Dfaz et al. (2014). 

Some connections of the ranking problem and the associated Laplacian matrix L are worth mentioning 
here. 

Lemma 3.1. For a ranking problem {N, A, M), the following statements are equivalent: 

1. Matches matrix M is not block diagonal; 

2. Comparison multigraph G is connected; 

3. The second smallest eigenvalue jin-i of L is positive. 

Proof. The equivalence of items 1 and 2 is straightforward, since if M is block diagonal, then there is no 
edge between the set of objects A^i and N 2 and vice versa. 2 3 is proved in (Mohar, 1991, Theorem 

2.1): the multiplicity of the Laplacian eigenvalue = 0 is equal to the number of components of graph 
G. □ 

From the three conditions above connectedness of the comparison multigraph will be used in our 
discussion. If some properties are required of graph G, it means that only the appropriate subset of 
ranking problems {N, A, M) is considered. 

A graph G is called bipartite if its node set N can be divided into two disjoint subsets U and V 
such that every edge connects a vertex in U to one in V. Equivalently, a bipartite graph is a graph 
without odd-length circles. Notice that a similar lemma can be stated for the other special structure of 
the matches matrix: it is block anti-diagonal if and only if the comparison multigraph is bipartite. The 
equivalence is due to the fact that the objects can be divided into two groups without comparisons inside 
the groups. 

Intuitively, uniqueness of the least squares solution should be provided when all objects Xi and Xj 
can be compared directly or indirectly, that is, there exists a chain Xi = Xkg,Xkj,... ^Xj-, = Xj such 
that for each £ S {0,1,..., t — 1}, Xk^ has been compared with 
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Proposition 3.1. The least squares rating q is unique if and only if comparison multigraph G is con¬ 
nected. 

Proof. In the unweighted case, see Bozoki et al. (2010, Theorem 4). The same theorem was proved by 
Kaiser and Serlin (1978, p. 426) in a different way. 

The general weighted case is examined in Bozoki et al. (2014) and Gonzalez-Diaz et al. (2014). 
Chebotarev and Shamis (1999, p. 220) mention this fact without further discussion. □ 

If there is no relation between two groups of objects Ni and N 2 as graph G is not connected, then it 
seems strange to rank them on the same scale. 

Now another solution is given for the least squares problem based on n x n matrices and n-dimensional 
vectors. In this sense it differs from the proposals of Gulliksen (1956) and Bozoki et al. (2010), but it is 
similar to the approach of Kaiser and Serlin (1978). 

Let I be the n x n identity matrix as before, J be the n x n matrix of I’s and, with a slight abuse 
of notation, 0 be both the n x n matrix and the n-dimensional vector of O’s. We adjust the Laplacian 
matrix in order to eliminate its zero eigenvalue. 

Lemma 3.2. Let G be a connected comparison multigraph. Then pLn-i > 0, the matrix L -\- (1/n) J is 
nonsingular with eigenvalues /ii, p, 2 , ..., 1. If L'^ is the Moore-Penrose generalized inverse of L, 
then [L (1/n) J] ^ = L~^ {l/n)J. 

Proof. This formula is well-known in the literature, it has been rediscovered several times. The first 
appearance may be in Sharpe and Styan (1965), see also Rao and Mitra (1971, Theorem 10.1.2) and 
Ghebotarev and Agaev (2002, Propositions 15 and 16). Here we give a new proof. 

The unweighted case is discussed in Gutman and Xiao (2004, Theorems 4 and 5). 

For the general weighted version, pLn-i > 0 was proved in Lemma 3.1. It is always possible to 
choose the Laplacian eigenvectors to be real, normalized and mutually orthogonal. The eigenvec¬ 
tor corresponding to = 0 is of the form = [1,1,..., 1]^ , S M". Since the eigenvectors 
= 1, 2,..., n — 1, are orthogonal to = 0 is satisfied for all k = 1, 2,..., n — 1. 

Then the extension of the result of Gutman and Xiao (2004, Theorem 4) to multigraphs is obvious 
as [L-I-(1/n) J] -|- (l/n)Ju(^^ = -|- 0 = for all fc = l,2,...,n — 1 and 

[L (1/n) J] -I- (1/n) = 0 -|- (l/n)nu*^”^ = thus the last eigenvalue of L -|- (1/n) J 

is equal to 1 with the corresponding eigenvector . 

Kwiesielewicz (1996) shows that LL+ = L+L — I — {\/n)J is provided in the weighted case, too. It 
implies that [L -\- (1/n) J] [L^ -\- (1/n) J] = LL^ -|- 0 -f- 0 -|- (l/n)^J^ = I — (1/n) J -I- (l/n)^nJ = I, since 
JL'^ = 0 (Kwiesielewicz, 1996, Theorem 4), consequently [L -\- (1/n) J]~^ = L+ -|- (l/n)J. □ 

Kaiser and Serlin (1978) use the matrix L -|- J in the unweighted case to circumvent the singularity 
of L. 

Theorem 3.1. Let G he a connected comparison multigraph. The unique solution of the least squares 
problem is 

q = =[L (1/n) J]~^ s. 

Proof. Lemma 3.2 provides the following equivalent transformations of the least squares problem: 

Lq = s <t=> [L -|- (1/n) J] q = s -|- (1/n) Jq O q = [L'*' -|- (1/n) J] s -|- (1/n) [L'^ (1/n) j] Jq. 

Since L~^J = 0 (Kwiesielewicz, 1996, Theorem 4), = nJ, and Js = (X]r=i = 0 due to e^s = 

Er=iS.=0: 

Lq = s q = L+s -|- (1/n) Jq. 

Normalization Jq = 0 can only be done if JL+s = 0, which is satisfied because JL+ =0. □ 

This solution concept resolves the problem of the singularity of L, while simple calculation is preserved 
since L+ can be obtained through the identity L+L = LL+ = I — (l/n)J. Theorem 3.1 is mainly a 
technical result, it means a step towards the iterative calculation of the least squares method. 
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4 The iterative calculation of the least squares rating 

In this section an iterative process is given for the calculation of the optimal least squares solution. 
Gulliksen (1956) offers a similar approach but his method is based on the choice of an arbitrary solution 
q and adjusting it according to the error term s — Lq. Our proposal starts with the first estimation 
of ratings by the row sums s which will be updated by the scores of objects compared with it, and 
so on. It is similar to the rating method called recursive Buchholz, defined on the aggregated paired 
comparison matrix R (Gonzalez-Diaz et ah, 2014). However, the latter uses an ’average’ setting, the 
modified scores vector s' and matches matrix M' after division by the number of comparisons di = 
for each Xi. Interestingly, despite the different approach, the recursive Buchholz ranking 
coincides with the one obtained from the least squares solution, it gives a rating vector which is the half 
of q (Gonzalez-Di'az et ah, 2014, Proposition 3.1). 

Recursive Buchholz is a special case of the recursive performance defined by Brozos-Vazquez et al. 
(2008) where uniqueness is proved for any matches matrix M, which is not block diagonal (comparison 
multigraph G is connected) and not block anti-diagonal (G is not bipartite). It was shown in Section 
3 that the first condition is necessary for the uniqueness of the least squares solution as well, while the 
second one requires some comments. A block anti-diagonal matches matrix represents a comparison 
structure similar to a ’team tournament’ where the objects (players) have two disjoint subsets (teams) 
such that players in one team do not play against their teammates. Thus the ratings of the players in one 
team can be calculated only through the ratings of the players in the other team and this cyclic feature 
thwarts the convergence of the iteration. An analogous problem will also emerge in our discussion. 

A digraph is an irreflexive directed graph consisting of a finite set of nodes N and a collection of 
ordered pairs P of these nodes. An edge from node Xi to node Xj represents a dominance relation of the 
former over the latter, and is represented by (i, J) G P. In our setting it may be discussed as a ranking 
problem {N,A,M), where N is the set of nodes, the elements of the results matrix A are restricted by 
Oij e { — 1,0,1} and the matches matrix M is defined by rriy = 1 if and only if {(t, j), (j, *)} n P 7 ^ 0. 
Furthermore, aij = 1 [(i,j) € P and (j,i) ^ P], Oy = — 1 [(j,t) G P and (i,j) ^ P], and = 0 

[{(z, j), (j, z)} n P = 0 or (i, j), (j, i) G P], but in the latter case there is a match between objects Xi and 
Xj, therefore rriij = niji = 1. This correspondence is clearly not unique; it can be legitimately argued 
that edges in both directions mean two matches between the associated objects. 

Herings et al. (2005) define the positional power of nodes in digraphs and prove that it can be obtained 
as the limit point of an iterative process. More details about this method will be given later in order to 
show its common roots with our iterative solution for the least squares method. 

Chebotarev (1994) gives a decomposition of the generalized row sum method by the powers of the 
parameter e. Let p,i be the greatest eigenvalue of L. 

Proposition 4.1. For all 0 < e < l//ii the generalized row sum rating vector is: 




. fe =0 


(1 -b smn)s = (1 -b emn)s — eP(l -b emn)s + e^L"^(l -b smn)s — s'^L'^{l + smn)s + ... 


Proof. See Ghebotarev (1994, Property 12). 
In particular, 


□ 


Xi = Si + e 


{mn - di)st + y^ mijSi 


j{e). 


A similar decomposition of the least squares rating is based on Theorem 3.1 and on the Neumann 
series (Neumann, 1877) of [L-b (l/n)J]~^. 

Lemma 4.1. Let B G The following statements are equivalent: 

1. The Neumann series = / -b P -b B^ -b B^ -b ... converges; 

2. All eigenvalues X of B are in the interior of the unit circle, that is, max{|A| : Ay = Py} < 1; 

3. lim„^oo P" = 0. 
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In which case, {I — B) ^ exists, and 


(/ - B)~^ B'^ = I + B + B^ + B^ + ... . 

k=0 

Proof. It is a special case of the theorem for Neumann series in Meyer (2000, p. 618). □ 

In order to decompose the least squares rating vector q, the Neumann series should be applied for 
L+ (1/n) J used in Theorem 3.1. Therefore, some results are necessary about its eigenvalues. According 
to the Gersgorin theorem (Gersgorin, 1931), all eigenvalues of the Laplacian matrix L lie within the 
closed interval [0, 2h], where 0 = maxjdi : i = 1,2 ,..., n} is the maximal number of comparisons with 
the other objects. In the unweighted case 5 < n — 1, and for a round-robin ranking problem 5 = n — 1. 

A regular graph is a graph such that every vertex has the same degree. A semiregular bipartite (or 
biregular) graph is a bipartite graph, for which every two vertices on the same side of the given partition 
have the same degree. 

Lemma 4.2. Let G be a graph with a decreasing degree sequence d = di > d 2 > • ■ • > dn (di = 

and L be the Laplacian matrix of G. Then 

Ml < 20. 


If G is connected, equality holds if and only if G is a regular bipartite graph. 

Proof. In the unweighted case, mi < max{c?(M) -I- d(u)|(u, v) S E{G)} and equality holds if and only if G 
is a semiregular bipartite graph (Anderson and Morley, 1985). It carries over to multigraphs since the 
number of matches between two objects is nonnegative (Mohar, 1991, Theorem 2.2). □ 

Notice that if the comparison multigraph G of the ranking problem {N,A,M) is regular bipartite, 
then M has a block anti-diagonal structure, but the reverse of the implication does not hold. 

Let us introduce the n x n real matrix C with = —lij = niij for all i ^ j and cu = t) — la = 
d — di = d — E is the same as the matches matrix outside the diagonal, where elements are 

increased (except for the object(s) with maximal comparisons) in order to provide that the sum of all 
row (and column) is equal. Then L = dl — C, therefore 

[L + (1/n) J]-^ = [DI-C+ (1/n) J]-^ = J 

In the following, stochastic matrix (l/c))C' is denoted by P. 

Theorem 4.1. Let the comparison multigraph be connected, and not regular bipartite. The unique 
solution q of the least squares problem is 

oo 

q = - P^s = - (s -1- Ps + P^s + P^s -I- ...) . 



Proof. Let A be an eigenvalue of ^ (C — ^J), namely, Ay = (C* — ^J) y for some y. It implies that 

5(1 - A)y = 0 [/ - i (C - i J)] y = {L + i j) y. From Lemma 3.2, 5(1 - A) e {mi, M 2 , • ■ ■, Mn-i, !}■ 
Since Mn-i > 0 also holds, 5(1 — A) >0, thus A < 1. As a consequence of Lemma 4.2, we have 
0(1 — A) < 20, therefore, A > —1. According to the condition of Theorem 4.1, G is connected, the 
equality holds if and only if G is a regular bipartite graph, resulting in the statement: G is not a regular 
bipartite comparison multigraph if and only if A > — 1. 

Hence all eigenvalues fulfil the requirement — 1 < A < 1, Lemma 4.1 can be applied for the matrix 
B — i (G — Aj). By applying the Neumann series on Theorem 3.1, we obtain 


q = 


[L+(l/n)J] 




But Js = 0, which leads to {P 


inJ) 


k 


s = P^s, therefore the assertion holds. 


□ 
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Figure 2: The balanced comparison multigraph of Example 4.1 




For ranking purposes, the multiplier (l/O) > 0 in the decomposition of q is irrelevant. It follows from 
Theorem 4.1 that the least squares solution can be obtained as a limit point of an iterative process. 

Proposition 4.2. Let the comparison multigraph he connected, and not regular bipartite. The unique 
solution of the least squares problem zs q = limfe^co q^^^j where 

q(°) = (l/0)s, 

q(fc) ^ q(fe-i) /c = l,2,.... 

O 

Proof. It is the immediate consequence of Theorem 4.1. □ 

The iteration process has an interpretation on graphs. In the following description, the multiplier 
I/O in the decomposition of q is disregarded for the sake of simplicity. Let G' be a graph identical to 
the comparison multigraph except that d — di loops are assigned for object Xi. With this modification, 
balancedness is achieved with the minimal number of loops, at least one node (with the maximal degree) 
has no loops. Graph G' is said to be the balanced comparison multigraph. It is the same procedure 
as balancing the weighted graph G by loops in Chebotarev (2012, p. 1495), where G' is called the 
balanced-graph of G. Note that G' is connected or regular bipartite if and only if G is connected or 
regular bipartite, respectively. 

Initially, all objects (nodes) are endowed with an own estimation of performance s, corresponding 
to the row sum vector. In the first step, the performance of objects compared with the given one is 
taken into account through the edges. Ps means the average scores of the objects that were compared 
with it (weighted by the number of comparisons, that is, the sum of edges between the two objects). 
The introduction of O — loops on Xi provides that the number of objects reachable on I-long paths is 
exactly 0. Now strength of objects compared with the given one is added to the original scores to get 
s + Ps. 

In the fcth step, the average scores of objects available on all fc-long paths P^s is added to the previous 
rating vector. If G is a connected, and not regular bipartite graph, then this iteration converges to the 
least squares ranking due to Theorem 4.1. Example 4.1 illustrates the decomposition of the least squares 
rating for the ranking problem analyzed in Example 3.1. 

Example 4.1. See the preference graph on Figure 1 and its balanced comparison multigraph on Figure 2. 
It is an undirected graph, the number of loops are determined by the differences O — di, [2, 2,1,1,0, 0,1]. 
Nodes are labelled by the score of the corresponding object. At the start (5q(°)/, every node gets Si. In 
the 1st step (bq^^^/, the scores of nodes reachable on a 1-long path are added with a multiplier l/O = 1/3. 
For example, in the case of Xi it is (2si + saj/O = 2/3. 

In the kth iteration, the scores of nodes reachable on a k-long path are added with a multiplier (1/5)^, 
where the number of scores taken into account is , analogously. It also implies that the denominator 
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in the fraction of the actual rating is a divisor of for all i = 1,2,... ,n. Theorem 4 .I ensures 
that this process converges if the comparison multigraph G is not a regular bipartite graph. 

The rating vectors obtained in the successive steps of the iteration process are as follows 


Iterated ratings 

q(0) 

q(i) 

q(2) 

q(3) 

q(io) 

q(50) 

q 

Xi 

1/3 

5/9 

21/27 

76/81 

1.5075 

1.8057 

1.8095 

X 2 

1/3 

5/9 

17/27 

56/81 

0.6915 

0.4800 

0.4762 

X 3 

0 

2/9 

7/27 

29/81 

0.6178 

0.8069 

0.8095 

X 4 

0 

-2/9 

-5/27 

-19/81 

-0.3535 

-0.5211 

-0.5238 

As 

1/3 

0 

1/27 

-7/81 

-0.2092 

-0.1912 

-0.1905 

Ae 

-1 

-8/9 

-31/27 

-95/81 

-1.4450 

-1.5231 

-1.5238 

A 7 

0 

-2/9 

-10/27 

-40/81 

-0.8090 

-0.8571 

-0.8571 


It immediately shows the role of comparisons. For example, the scores of Xi, X 2 and X^ are equal, how¬ 
ever, their position in the preference graph is significantly different, which can be seen in the subsequent 
steps of the iteration. The final ranking emerges only after the 13th step. 

Example 4.1 suggests two observations. The first is that ties are usually eliminated after taking the 
comparison structure into account, which can be advantageous in practical applications by reducing the 
demand for tie-breaking rules. The second is the possibly slow convergence: in order to get the final 
ranking of the objects, long paths also may be necessary to consider causing some difficulties in the 
interpretation since it is not exactly clear why they still have some importance. Nevertheless, the graph 
of Example 3.1 has few edges relative to a round-robin ranking problem, thus it is not surprising that 
many iteration steps are required. 

Theorem 4.1 has virtually no significance from a computational viewpoint since the least squares 
problem can be solved with a modest cost of O(n^) flops (Jiang et ah, 2011). 

Iterative scoring procedures used for ranking the nodes in a digraph can be traced back to the works 
of Wei (1952) and Kendall (1955), called the long-path method by Laslier (1997). It is based on the right 
eigenvector corresponding to the largest positive eigenvalue of the adjacency matrix. Moon and Pullman 
(1970) shows that the iterative procedure converges to a non-zero vector if the digraph is strongly 
connected, namely, there exists a path from Xi to Xj if Xi 7 ^ Xj. According to Chebotarev (1994) and 
Herings et al. (2005), the severely restricted domain limits the usefulness of this concept. 

This drawback is eliminated by the positional power measure (Herings et ah, 2005). Its rating vector 
p is the limit point of the sequence 

p° =0, 

pfc ^ yAe-i-Ij^Apfe-i^ fc = l,2,.... 

n 

Now the first step (T^e) gives the ’score’ of nodes in a digraph, the number of their successors. Sub¬ 
sequently, each node gets a fraction 1 /n of the previous power of its successors and a fixed amount of 
1. Herings et al. (2005) do not mention the use of the Neumann series explicitly. However, the decom¬ 
position in the proof of Herings et al. (2005, Lemma 4.2) is based on the equation [l — (l/n)T"^] = 

I + (T^)* as limfe^oo(l/?T-)'' = 0. 

Besides these common roots, we have identified three differences between the least squares rating 
and positional power of nodes in digraphs. The first is in the approach of the two ratings. According to 
the concepts of Chebotarev and Shamis (1999), positional power (as well as the Wei-Kendall method) 
is a kind of win-loss combining procedure distinguishing the wins and the losses of objects, while least 
squares is a win-loss unifying procedure, treating all results uniformly. Here the outcomes of paired 
comparisons only appear in the results matrix A, therefore they influence the ranking through s. 

The second difference is the role of iterated ratings: for the positional power they are allocated to 
the predecessors of nodes, whereas for the least squares, they remain on the objects. On the other side, 
positional power adds the original score T^e to the nodes in each step, while the least squares procedure 
uses the hxed scores vector s for the adjustment of previous ratings. 

The third, maybe the most interesting difference is the choice of parameter 1 /a reflecting the impor¬ 
tance of successors in the digraph. Herings et al. (2005) define it somewhat arbitrarily as 1/a = 1/n, 
the reciprocal of the number of nodes in the digraph, however, the procedure works for any nonnegative 
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numbers less than l/(n — 1). It would be interesting to see how this parameter 1/a can be increased. 
The proof of Herings et al. (2005, Theorem 3.1) certainly works if all nodes have less than a successors. 
In such a way the exact definition of a becomes endogenous, similar that of the least squares method, 
and the procedure will depend on the structure of the digraph. 

It was shown in Theorem 4.1 that for the least squares method the decay parameter 1/5 is determined 
endogenously by 5 = maxjfii : i = 1,2 ,... ,n}, the maximal number of comparisons of any objects and 
the iteration process works for all ranking problems {N, A, M) with a connected, and not regular bipartite 
comparison multigraph. The proposal of Herings et al. (2005) can also be applied in Theorem 4.1, as 
the convergence clearly holds if the parameter 1/c in the iteration is smaller than 1/5, which is provided 
if c > m(n — 1). For instance, c = mn is a value analogous to the idea of Herings et al. (2005), but it 
obviously differs from the least squares method. 

Remark 1 . In Theorem 4-1, the decay parameter 1/5 is determined endogenously by the decomposition 
of L = dl — C . If it becomes larger, the convergence of the Neumann series is not ensured, there will 
be more critical cases than regular bipartite graphs. If it is smaller, the iteration converges, however, 
loops will always appear in the balanced comparison multigraph G' and the interpretation becomes more 
complicated. 

The decomposition of the least squares rating works perfectly for regular graphs without loops. They 
are characteristic for some applications, like Swiss-system tournaments, and it is unlikely that such a set 
of comparisons results in a bipartite comparison multigraph. 

Other graph interpretations of the least squares method may be possible on the basis of the sys¬ 
tem of linear equations q = L+s. For example, a topological interpretation was given for L'*" in 
Chebotarev and Shamis (1998b, Theorem 3). 

Finally, it is worth to compare the graph interpretation above with the one given for the generalized 
row sum method by Shamis (1994). The latter calculates the number of fc-long routes with an even and 
odd number of drains (i.e. sequence of edges with some possible loops) between the objects, where e 
represents the importance attributed to indirect connections, that is, the fc-long routes have a weight 
of . It works for all £ < 1/ [2m(n — 1)]. We think that from a graph-theoretic viewpoint the above 
interpretation is more simple, however, the appearance of loops remains a weakness. 


5 Concluding remarks 

We have shown that the least squares ranking method has a graph interpretation with the exception 
of some special cases, when the comparison multigraph is a regular bipartite graph. The rating vector 
can be obtained as the limit of an iteration process based on scores and a decay parameter 1/5, where 
5 = maxjdi : i = 1,2,. ..,n} is the maximal number of comparisons, determined endogenously by the 
matches matrix M. 

Aggregation of the results A = eliminates a lot of information regarding the outcomes of 

paired comparisons, for instance, Oij = can be equal to 0 by adding both 1 and —1 or 0 and 

0. We do not know of any ranking methods which, besides the aggregated expected value a^ , account 
for the variance of , i.e. the bias from the fact that usually a!f^ is not equal to the average / mij. 
However, the difference — aijjmij can carry some information about the comparison of Xi and Xj: 

intuitively, their relative ranking seems to be more stable if ~ aij/niij for all p where r^^'^ is known. 
A possible way of addressing this reliability of the paired comparisons is through an adjustment of the 
number of comparisons between Xi and Xj. 

Since digraphs can be incorporated in our setting, it is interesting to connect the decomposition 
of the least squares method with the positional power of nodes in weighted digraphs (Herings et ah, 
2005). A weighted digraph is defined by the set of nodes and a nonnegative matrix W G where 

Wij > 0 denotes the weight of edge from Xi to Xj. It is natural to choose = Wij/(wij + wji) and 
nT-ij = Wij + Wji but in weighted digraphs edges from a node to itself {wu > 0) are also allowed. It 
remains an open question what is the relation of the two concepts. 

Finally, it follows from Theorem 4.1 that convergence is ensured for all multipliers less than 1/5, 
offering a natural way for the generalization of the least squares method. If G is not a bipartite graph. 
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the parameter can also be increased. Another promising direction may be the change of exponential 
decay. 
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